A machine learning approach to detecting fraudulent job types
نویسندگان
چکیده
Abstract Job seekers find themselves increasingly duped and misled by fraudulent job advertisements, posing a threat to their privacy, security well-being. There is clear need for solutions that can protect innocent seekers. Existing approaches detecting jobs do not scale well, function like black-box, lack interpretability, which essential guide applicants’ decision-making. Moreover, commonly used lexical features may be insufficient as the representation does capture contextual semantics of underlying document. Hence, this paper explores what extent different categorizations classified. In addition, seeks type are most relevant in classifying job. paper, we develop validate machine learning system identifying identity theft, corporate theft multi-level marketing amongst advertisements. We utilized four classes features: empirical rule set-based features, bag-of-word models, recent state-of-the-art word embeddings transformer models various classifiers. The were validated evaluating them on publicly available description dataset. Our results indicate transformer-based consistently outperformed handcrafted rule-set based class. Ultimately, Gradient Boosting classifier with combination parts-of-speech tags bag-of-words vectors achieved best performance an F1-score 0.88.
منابع مشابه
Detecting Encrypted Traffic: A Machine Learning Approach
Detecting encrypted traffic is increasingly important for deep packet inspection (DPI) to improve the performance of intrusion detection systems. We propose a machine learning approach with several randomness tests to achieve high accuracy detection of encrypted traffic while requiring low overhead incurred by the detection procedure. To demonstrate how effective the proposed approach is, the p...
متن کاملDetecting Spam Blogs: A Machine Learning Approach
Weblogs or blogs are an important new way to publish information, engage in discussions, and form communities on the Internet. The Blogosphere has unfortunately been infected by several varieties of spam-like content. Blog search engines, for example, are inundated by posts from splogs – false blogs with machine generated or hijacked content whose sole purpose is to host ads or raise the PageRa...
متن کاملA machine-learning approach to detecting unknown bacterial serovars
Technologies for rapid detection of bacterial pathogens are crucial for securing the food supply. A light-scattering sensor recently developed for real-time identification of multiple colonies has shown great promise for distinguishing bacteria cultures. The classification approach currently used with this system relies on supervised learning. For accurate classification of bacterial pathogens,...
متن کاملDetecting Targets in SAR Images: A Machine Learning Approach
This paper describes a novel application of the MIST methodology to target detection in SAR images. Specifically, a polarimetric whitening filter and a constant false alarm rate detector are used to preprocess a SAR image; then the AQ15c learning program is applied to learn and detect targets. Encouraging and impressive experimental results are provided. KEYWORD: Learning in vision, target dete...
متن کاملA Machine Learning Approach to No-Reference Objective Video Quality Assessment for High Definition Resources
The video quality assessment must be adapted to the human visual system, which is why researchers have performed subjective viewing experiments in order to obtain the conditions of encoding of video systems to provide the best quality to the user. The objective of this study is to assess the video quality using image features extraction without using reference video. RMSE values and processing ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: AI & society
سال: 2022
ISSN: ['0951-5666', '1435-5655']
DOI: https://doi.org/10.1007/s00146-022-01469-0